Towards real-time 3-D monocular visual tracking of human limbs in unconstrained environments
نویسندگان
چکیده
The 3-D visual tracking of human limbs is fundamental to a wide array of computer vision applications including gesture recognition, interactive entertainment, biomechanical analysis, vehicle driver monitoring, and electronic surveillance. The problem of limb tracking is complicated by issues of occlusion, depth ambiguities, rotational ambiguities, and high levels of noise caused by loose fitting clothing. We attempt to solve the 3-D limb tracking problem using only monocular imagery (a single 2-D video source) in largely unconstrained environments. The approach presented is a movement towards full real-time operating capabilities. The described system presents a complete visual tracking system which incorporates target detection, target model acquisition/ initialization, and target tracking components into a single, cohesive, probabilistic framework. The presence of a target is detected, using visual cues alone, by recognition of an individual performing a simple pre-defined initialization cue. The physical dimensions of the limb are then learned probabilistically until a statistically stable model estimate has been found. The appearance of the limb is learned in a joint spatial-chromatic domain which incorporates normalized color data with spatial constraints in order to model complex target appearances. The target tracking is performed within a Monte Carlo particle filtering framework which is capable of maintaining multiple state-space hypotheses and propagating ambiguity until less ambiguous data is observed. Multiple image cues are combined within this framework in a principled Bayesian manner. The target detection and model acquisition components are able to perform at near real-time frame rates and are shown to accurately recognize the presence of a target and initialize a target model specific to that user. The target tracking component has demonstrated exceptional resilience to occlusion and temporary target disappearance and contains a natural mechanism for the trade-off between accuracy and speed. At this point, the target tracking component performs at sub real-time frame rates, although several methods to increase the effective operating speed
منابع مشابه
Automatic Model Initialization for 3-D Monocular Visual Tracking of Human Limbs in Unconstrained Environments
Automated 3-D tracking of the human body is a necessary prerequisite for interactive entertainment applications, video security systems, computer animation, bio-mechanical analysis and humancomputer interaction (e.g., gesture recognition). Currently, technologies use artificial markers and a feature tracking methodology to recover the target poses. In addition, most tracking systems alter the w...
متن کامل3D Model-Based Tracking of the Human Body in Monocular Gray-Level Images
This paper presents a model-based approach to monocular tracking of human body using a non-calibrated camera. The tracking in monocular images is realized using a particle filter and an articulated 3D model with a cylinder-based representation of the body. In modeling the visual appearance of the person we employ appearance-adaptive models. The predominant orientation of the gradient combined w...
متن کاملReal - Time 3 - D Tracking of the Human
People are the central element in the whole enterprise of multimedia and communications and thus visual interpretation of humans and their movements is an important problem for computers. Here we describe a monocular and a stereo system for recovering 3-D descriptions of humans from images in real time. We discuss the technical details and present several applications using the systems for huma...
متن کاملEye-Tracking Method’ Usage for Understanding the Cognitive Processes in Multimedia Learning
Introduction: Designing multimedia learning environments should consist of the evidence-based study and principals about the human learning process. Eye tracking is a way based on the learner processing of learning materials which presented in multimedia learning environments. The aim of the study was to examine the use of the eye-tracking method to investigate the cognitive processes in m...
متن کاملReal-time 3d Multiple Human Tracking with Robustness Enhancement through Machine Learning
This paper presents a novel and robust vision-based real-time 3D multiple human tracking system. It is capable of automatically detecting and tracking multiple humans in real-time even when they occlude each other. Furthermore, it is robust towards drastically changing lighting conditions. The system consists of 2 parts, 1. a vision based human tracking system using multiple visual cues with a ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Real-Time Imaging
دوره 11 شماره
صفحات -
تاریخ انتشار 2005